Real-Time Keyword Extraction from Conversations

نویسندگان

  • Michalis Vazirgiannis
  • Antoine J.-P. Tixier
  • Polykarpos Meladianos
  • Giannis Nikolentzos
چکیده

We introduce a novel, fully unsupervised method to extract keywords from meeting speech in real-time. Our approach represents text as a word co-occurrence network and leverages the k-core graph decomposition algorithm and properties of submodular functions. We outperform multiple baselines in a real-time scenario emulated from the AMI and ICSI meeting corpora. Evaluation is conducted against both extractive and abstractive gold standard using two standard performance metrics and a newer one based on word embeddings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diverse Keyword Extraction from Conversations

A new method for keyword extraction from conversations is introduced, which preserves the diversity of topics that are mentioned. Inspired from summarization, the method maximizes the coverage of topics that are recognized automatically in transcripts of conversation fragments. The method is evaluated on excerpts of the Fisher and AMI corpora, using a crowdsourcing platform to elicit comparativ...

متن کامل

Document Recommendation for Conversation Based on Keyword Extraction and Clustering

Through this project we are extracting appropriate keyword from conversation input. Extracted keywords are matched with available documents. Finally, we recommend appropriate documents to the participants for reference. It also represents the problem faced during keyword extraction in conversation using automatic speech recognition (ASR) system which brings errors in result. In order to overcom...

متن کامل

Search Engine Optimization for Threaded-Conversations

Online discussion communities are becoming increasingly popular among web users, where an extensive amount of discussion and commenting takes place. However, it is difficult to search these conversations as search engines are not optimized for the conversation-structure of online communities. In this paper, we purpose a method of ranking search results based on the conversation-structure and us...

متن کامل

A System For Searching And Browsing Spoken Communications

As the amount of spoken communications accessible by computers increases, searching and browsing is becoming crucial for utilizing such material for gathering information. It is desirable for multimedia content analysis systems to handle various formats of data and to serve varying user needs while presenting a simple and consistent user interface. In this paper, we present a research system fo...

متن کامل

Polypus: a Big Data Self-Deployable Architecture for Microblogging Text Extraction and Real-Time Sentiment Analysis

In this paper we propose a new parallel architecture based on Big Data technologies for real-time sentiment analysis on microblogging posts. Polypus is a modular framework that provides the following functionalities: (1) massive text extraction from Twitter, (2) distributed non-relational storage optimized for time range queries, (3) memory-based intermodule buffering, (4) real-time sentiment c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017